Instance Selection: A Bayesian Decision Theory Perspective

نویسندگان

چکیده

In this paper, we consider the problem of lacking theoretical foundation and low execution efficiency instance selection methods based on k-nearest neighbour rule when processing large-scale data. We point out that core idea these can be explained from perspective Bayesian decision theory, is, to find which instances are reducible, irreducible, deleterious. Then, percolation establish relationship between three types local homogeneous cluster (i.e., a set with same labels). Finally, propose method an accelerated k-means algorithm construct clusters remove superfluous instances. The performance our is studied extensive synthetic benchmark data sets. Our proposed handle more effectively than state-of-the-art methods. All code results available at https://github.com/CQQXY161120/Instance-Selection.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal instance selection for improved decision tree

Instance selection plays an important role in improving scalability of data mining algorithms, but it can also be used to improve the quality of the data mining results. In this dissertation we present a new optimization-based approach for instance selection that uses a genetic algorithm (GA) to select a subset of instances to produce a simpler decision tree with acceptable accuracy. The result...

متن کامل

A Unied Bayesian Decision Theory

This paper provides new foundations for Bayesian Decision Theory based on a representation theorem for preferences de…ned on a set of prospects containing both factual and conditional possibilities. This use of a rich set of prospects not only provides a framework within which the main theoretical claims of Savage, Ramsey, Je¤rey and others can be stated and compared, but also allows for the po...

متن کامل

A Minimal Extension of Bayesian Decision Theory

Savage denied that Bayesian decision theory applies in large worlds. This paper proposes a minimal extension of Bayesian decision theory to a large-world context that evaluates an event E by assigning it a number π(E) that reduces to an orthodox probability for a class of measurable events. The Hurwicz criterion evaluates π(E) as a weighted arithmetic mean of its upper and lower probabilities, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i6.20578